fine tune PER, fix bugs #131

kengz · 2017-04-26T04:42:29Z

Bug Fixes, Improvements

fix overflow error in np.exp of SoftmaxPolicy, BoltzmannPolicy by casting to float64 instead of float32
improve overall np.isfinite asserts
remove index after reset in *analysis.csv
remove unused specs
reorganize and expand test specs
guard continuous action value range in continuous policies
fix analytics param variable sourcing

DDPG

add EpsilonGreedyNoisePolicy

PER

add memory.update(errors) throughout all agents
add shape assert for Q values and errors throughout
auto max_mem_len as max_timestep * max_epis/3 if not specified
put the missing abs for init reward

kengz added 30 commits April 24, 2017 08:51

use per for mountain ac

32e0634

mountain per

aea3a3f

fix per, add missing memory update to ddpg

763f78e

add walker ddpg per

22c1a2d

size down per

fc40e88

narrow down params

a352c2e

per for dqn v1

9cd6f61

fix and generalize shape assert

eeba6df

fix assert in shape

4156956

remove offset in botlzman qstate

186cc08

import np in sarsa

13f6f6e

fix critic assert dim

d2d2a9c

clipval for boltzmann at 200

e67cced

guard overflow again

77311ef

restore underflow fix

ef40317

minor refactor

41f30a9

clear out unused specs

f4cc428

drop index col from csv

43d89ed

mute per test

7d5e692

fix sarsa test

668729a

boltzman fix overflow by np float64; remove offset minus

d7f5cec

schedule mountain dqn per

efa048e

auto memlen for PER as 1/3 epi * timestep

8781232

auto mem len for walker, use PER for lunar

071e13f

fix assert size for ddpg

0457845

reorganize tests

2ea4b34

add more tests

21d8578

mute atari to speed up test

96cfcbf

guard continuous action range in policy

193ab59

add dqn_per to start per testing

93dcb2d

kengz added 3 commits April 26, 2017 22:16

add epsilonnoise policy

9d4ccd3

fix analytics param sourcing

e9de40e

use default mem_len for mountain per

d54676e

kengz merged commit 4f123b8 into master Apr 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fine tune PER, fix bugs #131

fine tune PER, fix bugs #131

kengz commented Apr 26, 2017 •

edited

Loading

fine tune PER, fix bugs #131

fine tune PER, fix bugs #131

Conversation

kengz commented Apr 26, 2017 • edited Loading

Bug Fixes, Improvements

DDPG

PER

kengz commented Apr 26, 2017 •

edited

Loading